- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources3
- Resource Type
-
0002000001000000
- More
- Availability
-
21
- Author / Contributor
- Filter by Author / Creator
-
-
Kuze, Tatsuki (3)
-
Gomez, Alfredo (2)
-
Schofield, Alexandra (2)
-
Babb, Simon (1)
-
Bayard de Volo, Theo (1)
-
Bayard_de_Volo, Theo (1)
-
Carothers, Morgan (1)
-
Celeste, Mia (1)
-
Gardi, Joseph (1)
-
Gross, Gianluca (1)
-
Harris, Dana (1)
-
Lee, Taeyun (1)
-
Liu, Nuo (1)
-
Mimno, David (1)
-
Plunkett, Fiona (1)
-
Qian, Julia (1)
-
Sultana, Sharifa (1)
-
Wu, Ingrid (1)
-
Wu, Siqi (1)
-
Wu, Yi-Chieh (1)
-
- Filter by Editor
-
-
null (1)
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Practitioners dealing with large text collections frequently use topic models such as Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) in their projects to explore trends. Despite twenty years of accrued advancement in natural language processing tools, these models are found to be slow and challenging to apply to text exploration projects. In our work, we engaged with practitioners (n=15) who use topic modeling to explore trends in large text collections to understand their project workflows and investigate which factors often slow down the processes and how they deal with such errors and interruptions in automated topic modeling. Our findings show that practitioners are required to diagnose and resolve context-specific problems with preparing data and models and need control for these steps, especially for data cleaning and parameter selection. Our major findings resonate with existing work across CSCW, computational social science, machine learning, data science, and digital humanities. They also leave us questioning whether automation is actually a useful goal for tools designed for topic models and text exploration.more » « lessFree, publicly-accessible full text available January 10, 2026
-
Babb, Simon; Celeste, Mia; Harris, Dana; Wu, Ingrid; Bayard de Volo, Theo; Gomez, Alfredo; Kuze, Tatsuki; Lee, Taeyun; Mimno, David; Schofield, Alexandra (, WeCNLP (West Coast NLP) Summit)
-
Carothers, Morgan; Gardi, Joseph; Gross, Gianluca; Kuze, Tatsuki; Liu, Nuo; Plunkett, Fiona; Qian, Julia; Wu, Yi-Chieh (, 11th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB 2020))null (Ed.)
An official website of the United States government

Full Text Available